British Library Catalogue Dataset in RDF/XML
From the British Library Metadata Services team:
Identifier BritishLibraryRdfCreator British Library Metadata ServicesMediatype dataLicenseurl http://creativecommons.org/publicdomain/zero/1.0/Publicdate 2010-11-16 00:30:43Addeddate 2010-11-15 22:23:45Backup_location ia903607_9
Use of Data: This data is being made available under a Creative Commons CC0 1.0 Universal Public Domain Dedication licence. This means that the British Library Board makes no copyright, related or neighbouring rights claims to the data and does not apply any restrictions on subsequent use and reuse of the data.
Support: The British Library is committed to providing high quality services and accurate data. If you have any queries or identify any problems with the data please contact firstname.lastname@example.org.
Share knowledge: The British Library are also very interested to hear the ways in which you have used this data so they can understand more fully the benefits of sharing it and improve their services. Please contact
email@example.com if you wish to share your experience!
Please, give credit where credit is due: The British Library has a responsibility to maintain its bibliographic data on the nation’s behalf. Please credit all use of this data to the British Library and link back to www.bl.uk/bibliographic/datafree.html in order that this information can be shared and developed with today’s Internet users as well as future generations.
There's more information about the British Library Catalogue Dataset in the Usage Guide PDF
You might also be interested in this post from Ben Osteen, Characterising the British Library Bibliographic dataset
, over on the Open Knowledge Foundation's Open Biblio blog
November 18, 2010
Some tips for processing
There is a small transfer problem with some of the files such that there is a lowercase rdf:description used at points. A global search and replace can fix this - for example "sed -s -i 's/rdf\:description/rdf:Description/g' BNBrdfdc*"
The records themselves are represented as blank nodes with no explicit URI so processing them can be more awkward than expected.
There is some information on this dataset on the openbiblio.net website, one post that may help is this one: http://openbiblio.net/2010/11/17/augmenting-the-british-librarys-rdf-data-to-allow-for-disambiguation/